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Q ■ We study the rate of convergence of the Mallows distance between 

fH , the empirical distribution of a sample and the underlying population. 

The surprising feature of our results is that the convergence rate is 
slower in the discrete case than in the absolutely continuous setting. 
We show how the hazard function plays a significant role in these calcu- 
lations. As an application, we recall that the quantity studied provides 

/\^ • an upper bound on the distance between the bootstrap distribution of 

3 . a sample mean and its true sampling distribution. Moreover, the con- 

venient properties of the Mallows metric yield a straightforward lower 
bound, and therefore a relatively precise description of the asymptotic 
performance of the bootstrap in this problem. 
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1 Introduction and main results 

Different problems in Probability and Statistics naturally lead to different 
choices of probability metric. One such choice is the Mallows distance, also 
known as the Wasserstein or Kantorovich distance. This metric has found 



extensive applications to a wide variety of fields; see JRachevI (|l984 ) for a 
review. 

Definition 1.1 For r > 1, let Tr denote the set of distribution functions F 
satisfying j_ \x\^ dF{x) < oo. For F,G & Tr, the Mallows metric dr{F,G) 
is defined by 

dr{F,G) = inf {E|X-Fr}^^^ 

where Txy is the set of all joint distributions of pairs of random variables X 
and Y whose marginal distributions are F and G respectively. We also write 
dr{X,Y) fordr{F,G), where this will cause no confusion. 

The empirical process is a fundamental quantity of interest. Though often 
not explicitly recognised as such, the Mallows distance between the empirical 
distribution and th e underlying populat i on has arise n in the work of s everal 
autho rs, including ICsorgo and HorvathI ( 19901 1 and Idel Barrio et al.l ( 19991 



2nnnh . 



Suppose Xi, . . . Xn are independent random variables, each having distribu- 
tion function F with mean fi and finite variance a^ > 0, and let F„ denote 
the empirical distribution function of the sample, given by 

1 " 

'■ • 1 

1=1 

The main contributions of this paper are threefold: 

1. In Theorem 12. 61 we show that in the case of a discrete underlying pop- 
ulation of finite support, n^^^d2{Fn, F) converges to an explicit non- 
degenerate limiting distribution. We contrast this with the n^/^ nor- 
malisation required by the previously cited authors in the absolutely 
continuous case. 



2. In Sec tion EI we study the ta i l cond itions required bv lCsorgo and HorvathI 
(1990) and ldel Barrio et al] ((2000) for the convergence of n^/^(i2(-^n, F) 
in the absolutely continuous case. In particular, by considering the 



hazard function, we show that one of the conditions of del Barrio et al 



( 2000.') is redundant, and the statement of Theorem 2.1 of lCsorgo and Horyath 



19901 ) may be simplified. 



3. Section mis devoted to an applic ation of these results. We recall the 
calculation of IShao and Tul ( 19951 ). showing that d2{Fn, F) provides an 
upper bound on the Mallows distance between the bootstrap distribu- 
tion of the sample mean and its true sampling distribution. We give a 
straightforward lower bound on this latter quantity, yielding conditions 
under which the upper and lower bounds are of the same order. 



2 Convergence rates and limiting distribu- 
tions for dr{Fn^F) 



First, recal l the following two lemmas about dr-, which are proved in iMaioi 
( 19781 ) and lBickel and FreedmanI ( 19811 ) respectively. 



Lemma 2.1 For F,G E T,-, the infimum in Definition \l.l\ is attained by the 
following construction: let U ~ t/(0, 1), and set X = F^^{U), Y = G^^{U), 
where, for example, F~^{p) = inf{x G M : F{x) > p}. Thus 



d,{F,G) = (^^ \F-\p) - G-\p)rdp^ 



l/r 



Lemma 2.2 // (F„) G JF and F G JF, then dr{Fn, F) -^ as n -^ oo if and 
only if, for every bounded, continuous function g -.M. ^ R, we have both 

g{x)dFn{x)= / g{x)dF{x); 

-OO J-oo 

/CXD /"CXD 

\x\''dFn{x) = / \x\''dF{x). 
-OO J — OO 

Thus, convergence in the Mallows metric dr is equivalent to convergence in 
distribution together with convergence of the rth absolute moments. 



It follows immediately by Lemma 12.21 and the strong law of large numbers 
that d2{Fn, -F) — > almost surely as n ^ oo. It is important to remark that 
although we can calculate a rate of convergence of the two parts above (that 
is, convergence in distribution and of the rth absolute moment), this will 
not help us find a rate of convergence of d2{Fn,F), and we must use other 
techniques. 

When F has a density /, ICsorgo and HorvathI (jlQQflf ) and Idel Barrio et al 



tennCl ) give conditions under which n^'^d2{Fn, F) converges to a nondegen- 
erate limiting distribution. A simple version of such results is given in The- 
orem 12.41 below. However, these results do not cover the case of a discrete 
underlying population, which is studied later, in Theorem 12.61 

In order to prove Theorem Y2A\ we need to introduce some notation. Let 

D = D[0,1] denote the space of left-continuous, real- valued functions on 

[0, 1] possessing right limits at each point. We may equip D with the uniform 

norm 

\\x-y\\oo= sup \x{p)-y{p)\. 
pe[o,i] 

A small complication arises from the fact that the normed space {D, \\ ■ ||oo) is 
non-separable, and the a-algebra, V, generated by the open balls is strictly 
smaller than the Borel a-algebra, I^Borei, gener ated by t h e ope n sets. This 
creates measurability problems, as explained in IChibisovl (19651), which lead 



us to work with the space {D,V, \\ ■ ||oo)- A consequence of using the ball 
cr-algebra is that we mus t make a sl i ght n iodification to the notion of weak 
convergence, in line with iBillingslevI (19991), p. 67: 



Definition 2.3 If(Yn)n>o is a sequence of random elements of{D,V, ||- ||oo), 

d° 

we write Yn ^ Yq as n ^>- oo if 

E(/(F„))^E(/(Fo)) 

as n -^ oo, for all bounded, continuous functions f : D ^ W which are 
T> -measurable. 

Throughout, B = (i?(p)) ^^ denotes a Brownian bridge; that is, a zero 
mean Gaussian process with 

Cov{Bip),Biq))=pil-q) 

for p < q. For p G (0, 1), let ^p = F~^{p) = inf{x G M : F{x) > p}. 



Theorem 2.4 Suppose that F has a density f such that f{C,p) is positive and 
continuous forp G [0, 1], and that the limits ^o = limp\o C,p and C,i = \im.pyi C,p 
exist in M. Then 

n'/'d2{F^,F) 4 n' B{pf{F-^)'{pfdp 
as n ^ oo. 



Proof Theorem 1 on pp. 640-641 of IShorack and WeUneii ( 19861 ). together 



with CoroUary 1 on p. 48 of the same book, give that 

mp)ny'{F-\p)-Q^B{p) (1) 

on {D,V, II ■ lloo), as n — > cxD. Now, with probabihty one, B belongs to the 
space (C[0, 1], || ■ ||oo) of continuous real-valued functions on [0, 1] equipped 
with the uniform norm, and moreover this space is separable. We can there- 

d° 

fore apply the version of the continuous mapping theorem for — > convergence 
(Billingsley, 1999, pp. 67-68) to a composition map h{p) = h2{hi{p)) from 
{D,V, II ■ lloo) to R. The individual maps hi : {D,V, \\ ■ ||oo) — * {D,V, \\ ■ ||oo) 
and /i2 : (-D, T>, || ■ ||oo) ^ R are defined by 

hi{GM = ^^ and h^iG) = (^^' G{p) dp^ . 

Observe that the continuity of hi follows from the fact that f{^p) attains its 
(positive) infimum for some p G [0, 1]. We conclude that 



I'/'d^iF^, F) = Q^' n{F-\p) - Q'dp"^ "' ^ l^f^ ^ dp"^ 



as n -^ oo. The result follows on noting that any bounded, continuous 
function from M to M is (Borel) measurable. D 

Theorem 12.41 is not the strongest possible; in particula r, it is not necessary 



for th e un derlying density t o have bounded support. ICsorgo and Horvath 



( 19901 ) and Idel Barrio et al.l (|2000) obtain the same conclusion with condi- 



tions which amount to control of the behaviour of the tails of F. For example, 
Equations (3.16) and (3.14) of ldel Barrio et al.l ( 20001 ) consist of the following 
two conditions: 



Condition 1 As n -^ oo, 

"l/n 



/■1/n pi 

/ {F^\p) - F-^(p))2 ^0 and n {F^\p) - F~\p)Y dp ^ 0. 

Jo J(n-l)/n 



Condition 2 



dp = — r^; ax < oo. 



/o /(i^-Hp))' 7-00 fix) 

A version of Theorem 2.1 of ICsorgo and HorvathI (|l990l ) gives the following: 



Theorem 2.5 Suppose that F has a density f such that f(Cp) is positive 
and continuous for p G (0, 1), and monotone for p sufficiently close to zero 
and one. If Conditions^ and\^hold, then 

1/2 

n 



i'/'d2{F^,F) 4 (^j\{pf{F'')'{pfdp 



as n ^ oo. 



Proposition 1 of Idel Barrio et al.l ( 19991 ) verifies Condition ^ when F is the 



normal distribution. We generalise this proposition in Section El by showing 
that Condition ^ holds when the hazard function diverges in the tails. In 
particular then, Condition^ will be seen to be a consequence of Conditional 

Now, we turn to the case of discrete random variables. Note that in The- 
orem 1221 below, we can understand the limiting distribution as the integral 
between and 1 of a sum of delta functions at the points where F~^{-) 
jumps. Thinking of the limit in this way shows the analogy with the limits 
in Theorem 12.41 and Theorem 12.51 

Theorem 2.6 Let F he the distribution function corresponding to the prob- 
ability mass function given by P(Xi = Xj) = pj, where xi < X2 < ■ ■ ■ < Xm 
and X]jLi Vj = 1; '^'^th coc/i pj > 0. For j = 1, . . . ,m, let qj = pi + . . . + pj. 
Then 

,m-l X 1/2 

as n — >• oo. 
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Figure 1: Graphs of F~^{p) (solid) and F~^{p) (dotted). The dotted hnes 
have been perturbed shghtly to distinguish them from the sohd ones. 



Proof See Figure [T] For j = 1, . . . ,m, define pj = n~^ Yll=i ^{Xi=x } to be 
the empirical proportion of the sample taking the value Xj. Let e = min^ pj/3, 
write qj = pi + . . . + pj, and define the event 

-4 = {\qj -qj\ < e for j = 1, . . . , m}. 

Observe that, by the Dvoretsky-Keifer-Wolfowitz inequality ( Massard . 119901 ). 



(m \ 

I J l^j - ^il > e ) = ^U^V \Fn{x) - F{ 



x)| >e < 2e 



-,-2n.e^ 



Thus, since |F„ ^(p) — F ^{p)\ < Xm — xi, 

dl{Fn, F)1ac < {Xm - XiflA'^ = Op{n 

as n -^ oo. On the other hand. 



1/2^ 



m— 1 



dl{Fn, F)1a = ^ \qj - qj\{,Xj+i - Xjf. 



But (r2^/^(gi — gi), . . . , n^^'^{qm — qm)) -^ Nm{0, S), where the asymptotic 
covariance matrix S has entries Sjj = qi{l — qj) , for 1 < i < j < m. So by 
the continuous mapping theorem, 

m— 1 m—1 

as n ^ oo. Hence 

m—l 

as ri ^ oo. n 

Theorems 12.41 and 12.61 extend to the case of general dr. Note the rates below 



coincide only when r = 1, a case studied bv ldel Barrio et al.l (JOOOb). 

Corollary 2.7 

(a) Suppose that F has a density f such that /(Cp) is positive and continu- 
ous for p G [0, 1], and that the limits ^q = limp\o'Cp ^'^^ ^i = linip/'i Cp 
exist in M. Then 

-1 \ 1/2 

/9 , ^ A ^N d / / 



i'/'dr{F^,F)^n \B{p)\^{F-'y{pydp 



as n -^ oo. 



(b) Let F be the distribution function corresponding to the probability mass 
function given by P(Xi = Xj) = pj, where Xi < X2 < ■ ■ ■ < Xm and 
'^T=iPj = 1; with each pj > 0. Let qj = pi + . . . + pj. Then 

(m-l ^ 1/r 

3=1 ^ 

as n ^ oo. 



Proof The proof of part (a) follows that of Theorem 12 .41 the only difference 
being that the function hi should in this case be defined by 



fdvY ' 

Part {b) mirrors the proof of Theorem 12.61 noting that in general, 

m—l 

i=i 
where the event A was defined in the proof of Theorem 12 .61 D 

3 The hazard function and tail behaviour 

In Section |21 we saw the importance of Conditions 1 and 2 concerning the 
tail behaviour of the distribution function F in determining the rate of con- 
vergence to zero of d^^Fn, F). We now study the nature of these conditions 
in greater detail, using the notion of the hazard function. 

Definition 3.1 For a random variable X with distribution function F and 
continuous density f , we define F{x) = P(X > x), and the two-sided hazard 
function by 

U(^\ _ / fi.^)lF{,x) tfx> ^1/2, 

^^"^^ \ f{x)/Fix) ^/x<6/2. 

The following theorem is the main result of this section, showing the redun- 
dancy of Condition ^ Its proof is deferred until after Lemma 13.41 

Theorem 3.2 If the hazard function satisfies h{x) —> oo as |x| — > oo, then 
Condition U\ holds; that is, 



fl/n pi 

n {F,;:\p)-F-\p)Ydp^O and n (p-^p) - F-\p)Y dp ^ Q 

Jo J{n-l)/n 

as n —>■ OO. Moreover, Condition\^ implies Condition^ 



Let Xt denote a random variable distributed as X|X > t, and consider Var Xt 
(we set Var Xf = if P(X > t) = 0). For example, if X is an exponential 
random variable, then Xt ~ X + 1, and we have Var Xt = Var X. In general, 
this tail variance is closely related to the hazard function, as shown by the 
following lemma. 

Lemma 3.3 Let X be a random variable with hazard function h{x). If t > 
^1/2, then 

with a corresponding result for the left-hand tail. 

Proof 

(a) Proof of the lower bound. Define the function 

sup^>t/(a;) 



kit) 



m 



Note that the density of Xt for x > t is f{x)/F{t), which is at most k{t). 
By comparison with the U[t,t + 1/c] distribution, we see that if a random 
variable has density bounded above by c, then its variance is at least l/(12c^), 
so Var Xt > l/{l2k{ty). Moreover, if x > t, then f{x)/F{t) < f{x)/F{x), 
from which we deduce that k{t) < sup^>t^(x). 

(b) Proof of the upper bound. In fact, we establish the stronger conclu- 
sion that 

EJjX - trMx>t}) ^ 4 

F{t) - inf,.>t/i(x)2- ^^ 

We use arguments based on th ose which estab lish Poincare inequalities, in 
Borovkov and Utevl (Il984f) and ISvsoeval ()l965t ). The simplest possible case 



of the main theorem of lSvsoeval (jl965t ) gives a Hardy inequality, that if G is 



a differentiable function with G{t) = 0, then 

^" f{x)G{xf dxKij^^ 7^^^"^^' dx = AJ^ /(a;)|M! rf^, (3) 
where g{x) = G'{x), provided the integrals exist. 
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[For the sake of completeness, note that Equation (jH)) can be proved as follows. 
Integration by parts gives that 

OO /•OO 

fix)G{xy dx = 2 F{x)G{x)g{x) dx 
t Jt 

OO \ 1/2 / /.OO p/ N2 \ 1/2 

2j^\ / / ^y-^) „/™\2, 



<2 / f{x)G{xydx] / ^-^g{xydx 



fix) 



by Cauchy-Schwarz.] Hence, from Equation Q, 



f{x)G{xfdx < ^- -—- / f{x)g{xfdx, 

and choosing G{x) = x — t, Equation follows. D 

This shows that if h{t) is bounded away from zero for large |t|, then Var Xt < 
OO. This is the case for the normal distribution, where the Mills ratio 
(Shorack and Wellner, 1986, p. 850) gives that h{t) > \t\ for all t. On 
the other hand, if h{t) — » as |t| — ^ oo, then Var Xt -^ oo, as for the log- 
normal distribution, where h{t) ~ t^^ logt as t — i> oo. Finally we remark that 
other tail variance behaviour is possible, in the case where liminft_joo h{t) < 

As the example of the log- normal might suggest, results from reliability the- 
ory provide a link between the hazard function and the finiteness of the 
moment generating function in a neighbourhood of the origin. 

Lemma 3.4 Let X be a random variable with hazard function h{x). If, for 
some t > 0, we have inf |a;|>j h{x) > c, then K{e^^) < oo for \6\ < c. 

Proof We prove the result for the right-hand tail. Observe that for x > t, 

which can be seen by differentiating both sides with respect to x. Thus 

F{x) = F{t)exp(-j^ h{y)dy\. (4) 
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Now the right-hand side of Equation (^ is bounded above by e '^^, which 
is enough to guarantee that the moment generating function is finite for 

ee[o,c). n 

Lemma 13.41 shows that if a random variable X has a moment generating 
function which is infinite other than at the origin, then there exists a sequence 
{xn) tending to infinity such that hm„_>oo h{xn) = 0. If, further, X has a 
decreasing hazard function then we deduce that hma;_,oo h{x) = 0, and hence 
by Lemma f3. 31 that hmi_^oo Var Xt = oo. 

We are now in a position to prove Theorem] 

Proof of Theorem 13.21 We have 



[ {F-\p) - F-\p)Y dp 

J(n-l)/n 



n 

l{n-l)/n 

= n {K^ip) - ^(n-l)/n + i{n-l)/n " F'^ {j>)f dp 

J{n-l)/n 

2 I ^ / (^ C ^2 



< 2( (X(„) - i{n-i)/n) +n {x- ^(„-i)/n) dF{x) 

V JS,(n-l)/n 

(5) 

The second term tends to zero by Equation Q- If ^i < C)0, then the first term 
clearly converges in probability to zero. On the other hand, if ^i = oo, then 
the assumed hypothesis on the hazard function combined with Equation P|) 
imply that for any t > 0, 

Fit + x) 
lim V,/ X = 0. 

x^oo Fix) 



These are is precisely the conditions under which iGalamboa ( 19781) , Theo 



rem 4.1.2, proves convergence in probability of X(„) — ^(„„i)/„ to zero. This 
completes the first part of the proof. 

Note that Condition |21 can be restated in terms of the hazard function; that 
is, by considering the regions x G [^1/2, cxd) and x G (—0x3,^1/2) separately, 
we see that Condition |21 holds if and only if 

1 , 
7— — dx < 00. 

12 



Finiteness of this integral implies that h{x) — > oo as |x| -^ oo, and so by the 
argument above, Condition [T] holds. D 

Note for the normal distribution that h{x) —>■ oo, but, again by the Mills 
ratio. 



I h{x) Ji ^x x^ 



dx > / I -] dx = oo. 



Thus Condition ^ holds (implying Proposition 1 of Idel Barrio et al.l ( 19991 )). 
but not Condition 121 

Finally in this section, we give a partial converse to Theorem 12.51 

Proposition 3.5 If Vai Xt ^ oo as t —> oo, then n^''^d2{Fn, F) — >' oo as 
n -^ oo. 

Proof For i = 1, ...,n, let Oj = n pJ\,F~^{p)dp. We decompose 
nd\{Fn,F) into a random and deterministic part as follows: 

ndl{Fn,F) = n V / ((X(,) - a,f + {F-\p) - a,f) dp 

i=l J(i~l)/n 

= V(X(,)-a,)2 + nV / {F-\p)-a,)'dp 

> (X(,) - a„)2 + n / (F"'(p) - a„)' c^P 

^(n-l)/n 



oo 



= (X(„) - a„) + ra / (x - a„) (iF(2;) 

"'?(n-l)/n 

= (X(„) - a„)2 + Var X^^^^^^^ . 
Ignoring the random part, the result is immediate. D 



4 Application to the bootstrap 



The b ootstrap was introduced into Statistics in the landmark paper of lEfron 



( 19791 ) , and gives a very general technique for approximating the distributions 
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of roots (i.e. functions of the sample and parameters of the underlying pop- 
ulation) of interest. The key idea for its use in practice is that of resampling; 
given a sample Xi, . . . , X„, we draw a further sample X^, . . . , X* uniformly 
at random with replacement from the original sample, and perform our cal- 
culations based on this resample. The opportunity to repeat this resampling 
procedure enables the practitioner to mimic drawing additional samples from 
the original population. 

Historically, one of the great early triumphs in the analysis the bootstrap 



was the paper of ISinghl (|1981[ ). One of his main results was to show that 
the bootstrap distribution of a normalised sample mean converges to its true 
sampling distribution at rate 0{n~^), provided the underlying population 
is non-lattice, and sufficiently many moments exist. This improves on the 
0(r7,~^/^) convergence rate of its natural competitor, namely normal approx- 
imation. Singh's theorem was stated in terms of the supremum distance 
between the respective distribution functions. The Mallows metric was fir st 



considered in the context of the bootstrap bv lBickel and FreedmanI (|l98l[ ). 

Recall that Xi, . . . ,X„ are independent random variables, each with distri- 
bution function F with mean fi and finite variance a^ > 0, that Fn is the 
empirical distribution function of the sample and that X„ = n^^ Yl^=i -^i 
is the sample mean. A standard procedure for constructing confidence in- 
tervals for yU is to invert a probability statement concerning a root such as 
n^/^(X„ — /i), whose samphng distribution under F we denote by Hn{F). 
Conditional on Xi, . . . , X„, let X^, . . . , X* be a resample; that is, an inde- 
pendent and identically distributed sample drawn from F„. 

The nonparametric bootstrap estimates the sampling distribution of ra^/^(X„— 
fi) by that of n^/^(X* - X„), where X* = n~^ Yh=i ^i- ^^ other words, con- 
ditional on Xi, . . . ,X„, we approximate if„(-F) by Hn{Fn). The properties 
of the Mallows distance d2 make it suitable for studying the performance of 
the bootstrap approximation in this context. The calculation below follows 
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Shao and TiJ ( 19951 ). and uses results proved in lBickel and FreedmanI ( 19811 ). 



/I - 1 

1 / " _ \ 1/2 



= d2(X*-X„,Xi-/i) 
<rf2(X*,Xi) 

= d,{F^,F). (6) 

Thus, in particular, the distance between the distribution of the root of 
interest, Hn{F), and its bootstrap approximation, Hn{Fn), is stochastically 
dominated by the distance between the true and empirical distributions. The 
fir st inequality and the coriditions under which we obtain equality are studied 



m 



Johnson and SamworthI (|2004f ). 

Having studied the upper bound d2{Fn, F) in previous sections, our final re- 
sult gives a lower bound on the rate of convergence of d2[Hn{Fn),Hn{F)y 
Observe in particular that if the hypotheses of Theorem 12.41 and Proposi- 
tion |1?T] are both satisfied, then the rates of convergence of the two bounds 
are identical. 

Proposition 4.1 

(a) If KXf < oo, then n^^'^d2[Hn{Fn), Hn{F)^ is bounded away from zero 
in probability; in fact, for every e > 0, 

F{n'/'d2{H,,,{F^),H^{F)) > e} ^ 



as n —> oo. 



(b) Fix 6 G (0,2). // E|Xip+'^ = oo, then, with probability one, 
nN{'^+^)(l^i^Hn{Fn),Hn{F)) does not tend to zero; that is 



F{n'/^'+'^d2{H^{F^),H^{F)) ^ O} = 0. 
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Proof For any r > 1 and Fx-, Fy € J>, there exist random variables (X, Y) 
having marginal distribution functions Fx and Fy respectively, and such that 

dr{Fx,Fy) = {^x -yyY'" > \{^xYf'^'-{^YYf'^'\. (?) 

This means that 

— k ^ "^l 

where s^ = n~^ Yli-^i ~ ^nY is the sample variance. Without loss of gener- 
ality, suppose that fi = KXi = 0. 

To prove (a), observe that if E,(Xf) < oo, then n^/^|s — a\ is bounded away 

from zero in probability; in fact, n^/^{s - a) S N{0,(EXf - a*) / (Aa^)) 
(Serfling, 1980, p. 119). 

To prove (6), note that 



1 " 

H^j:iX!-a^)-n^^^^-^^Xl 



-5/(2+5)/ 2 _ 
^* ^* ^ ' ^2/(2+5) 



^2/(2+5) 

Now, applying the law of the iterated logarithm gives that 

Moreover, since E|Xi|2+'^ = oo, b y the c o nvers e to the Marcinkiewicz-Zygmund 



Dytnecc 
f. iLoevd 



strong law of large numbers (cf. iLoevd (|l977l ). p. 255), 






Hence, by Kolmogorov's zero-one law, and since s"^ — (t"^ = {s — cr)(s + cr), 

p(^V(2+5)|^_^|^0)^Q_ 

n 

Note that the strength of the conclusion of part (6) of Proposition 14.11 in- 
creases as 6 decreases. 

16 
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